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We present a fault-tolerant semi-global control strategy for universal quantum computers. We show that 
N-dimensional array of qubits where only (N-l)-dimensional addressing resolution is available is compatible 
with fault-tolerant universal quantum computation. What is more, we show that measurements and individual 
control of qubits are required only at the boundaries of the fault-tolerant computer, i.e. holographic fault-tolerant 
quantum computation. Our model alleviates the heavy physical conditions on current qubit candidates imposed 
by addressability requirements and represents an option to improve their scalability. 
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I. INTRODUCTION 



In recent years much effort has been devoted to the realization of what was first envisioned by Shor and collaborators [1]: 
a quantum computer. To achieve this one basically requires: a set of qubits, and the ability to prepare, manipulate (through 
gates) and measure quantum information stored in those qubits [ ]. In the face of errors one must also execute fault-tolerant 
quantum error correction (QEC) to enable large scale quantum information processing [ ]. For fault-tolerant universal quantum 
computing (FTUQC) [ ] one typically needs some high degree of parallelism or simultaneity. In particular, traditional schemes 
require (i) complete addressability, (ii) sufficiently low error rates for operations, and (iii) simultaneity of such operations. Such 
tough requirements imply that the experimental execution of fault-tolerant schemes will be technically very challenging. For 
example, ion-traps have fairly low error rates (~ 10~ 3 )) and good addressability but scaling ion traps to large number of ions 
where many gates can be executed simultaneously is highly challenging [ ]. On the other hand, neutral atoms trapped in optical 
lattices posses large numbers of qubits and also simultaneous control but individual addressing is very challenging [6]. In general 
there is a trade-off between the capability to perform individual addressing and large-scale simultaneous addressing. This leads 
us to ask the questions: which of the requirements (i-iii) can be relaxed and still achieve FTUQC with a reasonable threshold? 
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Relaxing the requirements may pave the way for implementations to scale up the number of qubits and achieve fault-tolerant 
quantum computation. 

In a previous paper [ ] we approached the problem trying to reduce the requirements on criteria (ii) the accuracy of operations. 
One can reduce, in principle, the addressing requirements using global control techniques. In global control, gates don't have to 
be individually addressed and one can achieve universal quantum computing using global operations. The preparation of specific 
quantum states can be replaced with a resetting process plus a gate, moreover the resetting process can be executed in parallel, 
greatly reducing the addressing requirements. Only measurements seem to require addressing. This apparent requirement poses 
heavy experimental constraints when one demands fault-tolerance: error correction in every logical qubit simultaneously, at 
every computational time-step, which means that measurement-aided syndrome extraction would require one to simultaneously 
and distinguishably measure the state of many qubits. Distinguishable measurements with good accuracy may be possible in 
small qubit registers but will be very challenging to scale up [ ]. Consequently, to avoid these and other issues, in Ref.[ ] we in- 
troduced circuits to remove measurements from quantum error correction protocols and greatly reduce the role of measurements 
in fault-tolerant universal quantum computing. Our results showed two things: (I) gates must be quite accurate while preparation 
and measurements can have error rates as high as 33%; (II) because measurements are removed, simultaneous and paralell FT 
QEC can be executed over many logical qubits. 

In this paper we explore deeper into the consequences of (II) and propose a novel concept with the potential to greatly alleviate 
not only the measurement-related issues mentioned above but also reduce some of the addressability constraints which typically 
burden fault-tolerant computer designs. We show for the first time that fault-tolerant holographic, i.e. addressing the boundaries 
only, universal quantum computing is possible. We will argue that such holographic design not only represents a way of reducing 
the number of individually addressable qubits but also inherits the results we obtained in Ref.[ ] regarding the tolerable high 
error rates for fault-tolerance, 

The paper is organized as follows. In section (II) we discuss the addressing required in typical FTUQC designs, and describe 
the schematic idea of a holographic, i.e. semi-global with boundary addressing only, control scheme which reduces the number 
of controls and amounts of addressing required while maintaining fault-tolerance. In section (III) we formalize the scheme and 
explicitly describe the necessary tools. In particular we describe fault-tolerant routines which make minimal use of measure- 
ments; these measurements can be done exclusively on the boundary i.e. holographically (or offline). We then show how our 
model can be mapped to a lower dimensionality array but now requiring additional next-to-nearest neighbor gates. We discuss 
(Section (IV)) the error threshold for this design and finally in section (V) we proceed to make an analysis of the resources 
required and show that the number of controls in our strategy has only a weak dependence with the number of computational 
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qubits, as opposed to traditional fully addressable architectures. 

H. A SEMI-GLOBALLY CONTROLLED QUANTUM COMPUTER 

We initially consider a 3D array of qubits with nearest neighbor interactions although we urge the reader to keep in mind that 
our real goal is to show that a N-dimensional, N = 2,3, array can execute fault-tolerant quantum when only (N-l)-dimensional 
individual addressing is available. We will first develop the 3D model and then, in section (III A), argue that the model can work 
in a 2D array with ID individual addressing. A 3D array of qubits makes efficient use of space, however in a 3D spatial array it 
will typically be hard to measure/manipulate individual qubits in the bulk of the array, i.e. 3D addressing resolution. Instead we 
assume a 2D addressing resolution, i.e. the ability to address lines of qubits in the array. We label indexed array of 3D locations 
by coordinates (x,y,z), where each coordinate s E [l,N s ], for s = {x,y,z} and label the addressable lines in the 3D array by (x,y). 
The action of a single-qubit gate addressing the line (x,y) is given by Uu y ) = Ylz U^^, while two-qubit gates between addresses 
(x,y) and (x 1 ,/) is given by V({x,y),(x',y')) = Hz ^(x,y,z),(^y,z) > w * m trie obvious generalization for multi-qubit gates. Approaching 
measurements this way is not practical as it does not allow one to discriminate individual qubit measurement results along z. We 
shall assume that all measurements are executed at the boundaries. In fact we allow the possibility of executing any operation 
on the z-boundaries i.e. 0(^,1) and Ouyfl) for any (x,y). The addressing limits described above impose a constraint on the 
type of gates we can execute in the z direction: for example, we cannot directly execute a gate of the form V{x^)( x , y ^)- We 
note that we allow long-range interactions within every z-plane but that one can restrict to nearest neighbor gates with a slight 
reduction of the fault tolerant threshold due to the introduction of intermediate SWAP gates. We require the ability to execute 
nearest neighbor CZ gates in the z direction along (x,y) columns. We will show that this limited addressability, where we can 
only address columns of qubits in the 3D array, yield universal quantum computing and, more importantly, fault-tolerance. 

A. Global control 

Lets consider the global control model introduced in [9] (which is closely related to [10]). In [9] the authors consider a one 
dimensional array of qubits, e.g. consider the single column x — 1 — y in our 3D array show that with the global operators 

cz= n cz ((i,i,z),(i,i,z+i)> ff =n%,y' w 

z=l z=l 

and non-Clifford gates executed at the edges of the ID array (z = l,N z ), universal quantum computation is possible within 
this ID array. Their basic idea uses the global gate T = CZ ■ H to propagate information in a controlled way within the ID 
array. T Nz+1 is equivalent to executing a spatial reflection of the information stored in the 1-D sub-array along the z-direction: 
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FIG. 1 : Schematics of the addressability requirements and qubit distribution of our semiglobal architecture. Vertical global pulses are capable 
of executing the single-qubit (green) and two-qubit gates (red) described in the text. Computation is achieved through the vertical nearest 
neighbor independent T pulses which can be decomposed into the two subroutines (black and brown T pulses), in virtue of the ABAB 
addressability, so as to ensure fault-tolerance. The end-planes (darker blue) are fully addressable independently of the bulk of the computer, 
and must have extra space to accommodate a \Hi) state encoded at the highest level of concatenation in order to execute the non-Clifford 
encoded gates. All planes plains contain enough qubits to hold an encoded qubit, the ancillas required for its unitary UQEC. We also require 
that every plane has physical qubits that can be reset (simultaneously in all planes) to use a resource for algorithmic cooling (simultaneous in 
all planes), note that the reseting operation assumes no single plane-addressability. 

fNz+i ■ p . _^ P(i,ijv+i- 7 ), where p is the density matrix of the array. By executing Clifford and non-Clifford gates at the 
boundaries of the ID array and between the r -pulses it was shown that universal quantum computing is possible. We refer the 
reader to Ref. [ ] for details. 

The problem with global control is its apparent incompatibility with fault-tolerance. The use of global pulses, usually in the 
form of nearest neighbor gates leads to two serious obstacles towards implementing fault tolerant QEC: (i) traditional global 
control techniques may give rise to correlated errors in a codeword, which although not necessarily lethal for fault-tolerance, are 
known to reduce the error correcting capabilities of a code [ ], and, perhaps the most relevant, (ii) the uncontrolled propagation 
of errors to multiple locations e.g. due to the interaction of a faulty qubit with others. 

Although some evidence for the existence of a fault-tolerant threshold using global addressing exists [14], no such threshold 
has been calculated to date. In any case, as global control uses many global operations to implement a single logical gate, 
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globally simulating traditional error correction circuits can only lead to a worse threshold. 

Rather than propose a fully globally addressed architecture we here propose to use a hybrid strategy where, while the address- 
ing requirements are reduced, fault-tolerance is still possible. The central concept of our hybrid design is to trap any possible 
correlated error so that it won't affect more than one qubit in every logical qubit (or more than one depending of the QEC code 
we are using, for simplicity we will assume in this paper distance 3 codes). More importantly, we restrict the direction in which 
errors can propagate, ensuring that correlated errors propagate within separate codewords. To arrange for this we consider a 3D 
cubical array of physical qubits, where each xy-plane contains a logical qubit encoded in a CSS code e.g. Bacon Shor code, 
Steane code, etc. The encoded T gate is then a set of vertical, nearest neighbor gates, in the z-direction which, given the choice 
of code, are transversal, i.e. bitwise, 

T = II CZ (x,y,z),(x,y,z+l) H (x,yz) = Yi ^ 
(x,y,z) (x,y) 

where f( x , y ) — Ylz^(x,y,z),(x t y,z+i)^(x,y,z)- We note that a single faulty physical f( x . y ), i.e. any combination of errors in the 

gates composing f( x , y ), can generate a correlated error, but such error will only affect one qubit in every plane, i.e. in every 

logical/encoded qubit. Furthermore, to avoid simultaneous CZ gates targeting or controlling more than one logical plane we 

execute T in two steps to get a fault-tolerant version, T, 

°~ e e ~ ° T — r 

^ = CZ (x.y) x CZ (x,y) x 1 1 H (x,y,z)i ( 3 ) 
(x,y,z) 

- — o—e - — -e—o 

where CZ^ = Yl{x,y,ze[i,lN z /2}])C z (x,y,2n-i),(x,y,2n) and CZ^ = Y[(x, y ,ze{i,lN z /2\})CZ(x !yj2n ) } ( X} y,2n+i)- This would require an 
ABAB plane addressability where we are able to execute AB and BA nearest (plane) neighbor operations. Moreover if we 
use the Bacon Shor code, the CZ is not completely transversal but requires an extra %/2 physical rotation of one of alternate 
(x-y) planes in the 3D array. To avoid this, particularly for the BS code we benefit from the ABAB plane addressability: (i) even 
planes will be encoded in the BS code while odd planes are encoded in a Jt/2 rotated BS code and (ii) even and odd planes will 
have independent error corrections according to the standard or rotated BS encoding. Other CSS codes, such as the Steane code, 
won't have this issue as the CZ gates are transversal in this code but will still require the ABAB addressability for the execution 
of the fault- tolerant 1 . 

B. Quantum error correction without measurements (UQEC) 

We now consider how to execute quantum error correction routines given the limited in-plane addressability. Our restriction to 
columnar semi-global operations forbid the the execution of parallel individual measurements along the z-direction. This would 
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indicate that parallel, measurement-aided, error correction and thus fault-tolerance is not possible. The straightforward solution 
is to remove measurements from quantum error correcting routines, and use suitably designed circuits instead. In Ref. [7] we 
introduced such unitary error correcting routines for the BS code (Fig.(2(c))) based on the so-called 5Vf-gate: a majority voting 
gadget (Fig.(2(a))). Although the specific design we use here for the M-gate is tailored for the BS code we stress the measurement 
free scheme below applies independently of this choice of QEC code (See supplementary material in Ref. [ ]). Armed with the 
appropriate unitary quantum error correction (UQEC) gadgets and certain trans vers ality properties (which we outline below) the 
semiglobal scheme can be applied to a variety of CSS [ ] codes. 

The error correction scheme used here is based on two routines: an error correction routine for the QR code, the 5Vf, and a 
mapping between CSS and QR codes, using a gadget we dub the f^-gate. The Bacon-Shor (BS) code is a composition of, X- 
and Z- base, quantum repetition (QR) codes defined by the following stabilizer set on a two dimensional array of 3x3=9 qubits: 

{xxi i x x z z z ^ ^ ^ 1 
xxi,ixx,zzz,zzz\. (4) 
XXI I X X III z z z J 

For this code the logical Pauli operators are given by X^ = Y[^=i^i.i'^L = IlLi^i,; modulo stabilizer operations, i.e. Xi (Zl) 
acts on a column (row) of the array. This code is a subsystem code and is invariant under pairs of X(Z) operators along any 
given row(column) because they act only on gauge degrees of freedom. Given the subsystem structure of the code one is able 
to correct acting on only one row (for X-errors) and on only one column (for Z-errors). An encoded 5Vf-gate provides a way of 
executing BS error correction. Here is where the i^-gate comes into play: as an interface between the BS and the QR codes. 
Let us explain how the UQEC protocol operates. For illustration lets assume we are executing the X-correction stage i.e. lower 
part of Fig. (2(c)). We can extract the syndrome information using BS encoded gates at any level of concatenation; then we use 
the i?\£-gate to take a BS encoded syndrome to a QR encoding. This is possible because the syndrome used in a particular error 
correction stage only needs protection against one type of error, e.g. during the X-stage, the syndrome must be protected against 
X errors, and not Z errors. After this step we have three syndrome strings, one for each column of the encoded 3x3 array, 
(sl,s2,s3). If we tried to correct every column we would destroy the gauge-freedom available for the BS code, so in order to 
maintain gauge freedom we use these strings to vote into a fourth 54 = s\ © S2 © S3, which will control the final correction. This 
step is achieved through the V9\[ routine described in Fig. (2(b)). Note that the S4 string is encoded in the QR code, so it may 
seem that we won't be able to execute the correction on a BS encoded state. However the interplay between CSS and QR codes 
is very interesting: a CNOT gate and more importantly a TOFFOLI gate can be executed using QR encoded controls and can 
target a subset of the CSS encoded state, e.g. targeting one column of the BS encoded state. To actually execute the 54 correction 
we just copy it with a cyclic rotation in order to execute the correction through a bitwise TOFFOLI gate. To see how the voting 
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FIG. 2: Measurement-free QEC routines for the QR and BS code. The inputs are |3(*)\ = |000}® 3 * ' and = |+ + +)® 3 ' '. (a) The 

M gate. An X-encoded majority voting gadget of level~(fc+ 1) of concatenation. Here all CNOTs are bitwise, i.e. each CNOT depicted 
corresponds to three CNOT(k), and subscript R corresponds to a cyclic fc-encoded rotation of the targets of the corresponding gate. In the QR 



code the TOFFOLI gate depicted is bitwise. The M gate can also be designed for a Z-encoded quantum majority voting, with + ) ancillas 
and the obvious Hadamard conjugation of gates. When the need to distinguish them arises we shall denote X and Z encoded majority votings 
and M^ z > respectively, (b) A subroutine acting on ancilla for processing error syndrome information extracted from the data. The circuit 
shows one row, (V9\£),(fc), of the fully contracted exRec V9\[(k) representing a collection of fc-level protected gates acting on row i of ancilla 
which take part in an EC(k+ 1) step. Note that in this circuit the output top lines are discarded so no EC gadget must protect them. With this, 
the exRec corresponding to i\£ at degree of concatenation k is = EC(k) x Xherows(^^Qi(k~ 1) x EC(k). In our circuits LG(£)J denotes 
the implementation of gate G, in terms of level- (k—1) gates, without the prepending or appending EC(k) . ( c) Full error correction ( EC) gadget 
for the BS code. The orange and pink boxes represent the syndrome extraction stage. Here, a TOFFOLI with o controls is a Z — TOFFOLI; 



= n? f-i CNOT f W . . is a set of transversal CNOTs, CX ( P ±] = FT; , , CNOT>*>. . w , . . , and CX, r f, = FT; ,_, CNOT r w . . , , w . . . The 
LL 'J=l (c,ij),(t,i,j) Pr li 'J=l (c,i,;),(f,i,;±l) (±) 11 /J=1 



l (c,U±l),(t,iJY 



control of the gates in boxes is always the top input of the gate. The W gate is a wait (identity) gate, and the last gate on the upper and lower 
half is a transversal bTOFFOLI. 



respects the gauge freedom consider the following scenarios: a single error in e.g. column one, leads to .54 = s\, which would 
correctly execute the correction by virtue of the gauge freedom; on the other hand a gauge-like operation, two X-errors in the 
same row, leads to 54 = s s = 0, which correctly implies an identity correction operation. An analogous analysis holds for the 
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Z-error correction. This completes our schematic description of the BS UQEC gadget. 

To complete a UQEC scheme capable of achieving fault-tolerance we must provide fresh ancillas in every (x—y) plane. Fresh 
ancillas can be generated via a semi-global reset, which in contrast with measurement, does not need to output a result and thus 
imposes no addressability constraints. A noisy version of this operation with fails with probability />/_n can be modeled by 

P->(1 l°>< l+W> < 5 > 
where p is an arbitrary density (of a qubit in each x-y plane) matrix ideally mapped to |0) (0| state, or to an arbitrary density 
matrix r| with probability p/ p \ . If we require an even lower error rate, then we can further use an algorithmic cooling protocol [7] 
in parallel to distill colder |0) states and effectively reduce it, p/p\ — > p/p\ < p/ p y With this fresh ancillae we can also unitarily 
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correction at every level of concatenation, using the following routines. 
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and + y: Using three copies of |0) we execute and 5Vf( x ' gate to ensure one has a 
the 9Vf gate is taking the role of the error correction gadget. This routine can be concatenated any number of times to get 
OW \. To get a Z-encoded QR code we instead start with three copies of |+) and execute a fM ^ gate. 



• Preparation of 
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- L /.By starting with a 3 x 3 array of |+) of physical qubits, and applying a 5Vf ^ in every 
column we can prepare a Similarly \0i) is obtained by starting with a 3 x 3 array of |0) and executing a fM^ z 

in every row. This process can be concatenated to fault-tolerantly create encoded 
concatenation of concatenation k. 



Of) or 



f ) states at any level of 



So this allows us to continuously execute massively parallel unitary fault-tolerant quantum error correction on all logical planes 
which provides a fault-tolerant memory. 



C. Error considerations 

So far we have described a way of executing simultaneous UQEC in every plane of the array. However to truly achieve 
fault-tolerance we must have a consistent error model which takes into account that the vertical pulses may introduce correlated 
errors. We are considering every vertical pulse as a single error location: each possibly faulty vertical addressing pulse admits 
any error, correlated or uncorrelated, in any of the qubits it addresses. The semi — global character of our design demands 
also that: errors induced by an s-qubit semi-global operation addressing columns {(xi,yi), (x s ,y s )} are independent of errors 
induced by an s'-qubit semi -global operation addressing columns {(x^/J, (x' s ,y' s )}. Thus in practice we have an adversarial 
(local) stochastic error model in every plane (in two dimensions). 
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When we extend our dimensionality, then every s-qubit gate addresses s strings of N z qubits. Let us now assume that our semi- 
global columnar controls are not applied homogeneously along the individual columns and model an inhomogeneous version of 
gate a U, acting on some column, as U = UY\ z e lQzHz , where e'® zHz accounts for a potential inhomogeneity of the pulse generated 
by H z , and H z indicates that each location z of the column may undergo a different undesired rotation. Clearly a gate can fail 
in other ways due to the coupling of the qubits with their environment, but in general we can parametrize such error by some 
parameter Q z . So we can model an error in a semi-global s-qubit gate as the error implied by the stochastic error model on a 
N z x s qubit columnar gate. For example, a single qubit semi-global gate can be modeled as a A^ z -qubit physical gate where we 
assume any type error is possible: single qubit errors, unwanted correlations arising among the N z qubits, etc. Now the condition 
for fault-tolerance, under the stochastic error model, is translated into a constraint for every z, i.e. that Q z is below some threshold 
Vz, i.e. in every plane. Note that these sorts of errors are potentially correlated. However we note that these correlations are 
spatially bounded, i.e. in the case of a s-qubit semiglobal gate, are always confined to s-qubits in every logical plane, which 
makes our model effectively a 2D local stochastic adversarial error model. We next extend these error considerations to the 
vertical CZ( 0_e ) and CZ' e ~°) operations we use to execute universal quantum computation, we assume that an error in T( x . y ) is 
independent of an error in T^yy 

As should be expected, our concept to encode separate logical qubits in separate x-y planes to protect them from correlated 
errors will eventually fail if the error rates are high enough. When enough errors accumulate such that one qubit at the highest 
level of concatenation is compromised, then all logical qubits at this level of concatenation in the register are compromised 
since they will eventually couple to it via the spatial reflection protocol, and the computation fails. This may seem a deal 
breaker but this is no different to what happens in individually addressed fault-tolerance. To show this let's consider a quantum 
computer running a specific circuit: if at some point enough errors accumulate such that one of the logical qubits at the highest 
concatenation level is compromised, then whatever circuit you operate with this logical qubit will be faulty because of error 
propagation. Thus loosing one or all logical qubits at the highest level of concatenation to accumulated errors is equivalent. 

Additionally, since we plan to use the semi-global control scheme to perform a computation, every logical gate (at the highest 
concatenation level) is a sequence composed of at least A(N Z + 1) T pulses plus, non-Clifford gates executed at the ends (see 
next section for details). Thus the computational size of any given circuit we want to simulate is increased by a factor ~ 4(AT Z ), 
i.e. its' complexity is increased by one order in N z , as compared to the circuit with full addressability. This, as we will see below, 
has consequences, not in the threshold value, but on the degree of concatenation required to achieve some fixed accuracy in the 
outcome of a given quantum circuit to be simulated. 
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III. HOLOGRAPHIC FAULT-TOLERANT UNIVERSAL QC 



The previous section described how to execute fault-tolerant quantum error correction using semiglobal control and a compat- 
ible error model. Using the scheme in [9], we can achieve universal QC, if we can execute non-Clifford gates at the z end-planes 
of our 3D array (z = 1 and z = N z ). So, under the error considerations and using the unitary quantum error correction tools 
described above, if we can execute encoded fault-tolerant Clifford operations at the boundaries we will achieve holographic 
semi-global fault-tolerant universal quantum computing via 1 pulse sequences. Note that after each of the stages in the 1 
steps (see (3)) we are able to execute unitary quantum error correction. We show now how to achieve the execution of the 
non-Clifford gates at the end z-planes. Recall that we allowed the execution of any operation at the boundaries of the array, 
including measurement, thus we can execute the following routines which will yield the sought fault-tolerant universal quantum 
computation: 

i. Z 1 / 4 (non-Clifford) gate aided by encoded measurements.- To execute a Z 1 / 4 gate we use the circuit in Fig. (3(b)). This 
circuit assumes the ability to prepare a Magic state ancilla, \Hi) — (|0i) +e' n / 4 \l£))/-\/2, at the highest level of concate- 
nation with an error rate of the same order of the gates as such level of concatenation. To do so we use a two step routine. 
First we encode a physical \H) state and use the encoder circuit (described below in (iii)) to get \Hi) . This encoded ancilla 
will typically have an error rate higher than the physical error rate and thus is not yet useful. However it can be shown [7] 
that if the effective error rate of such preparation is below pfj-ancs — 14.6% then they can be used as a resource for magic 
state distillation [16] (MSD). The MSD protocol, composed exclusively of Clifford operations, has two requirements: (i) 
Clifford operations are perfect, an approximation justified becasue we are executing encoded gates at the highest level of 
concatenation L and thus Clifford gates can be made arbitraly noiseless provided we choose L large enough, and (ii) a 
source of noisy copies of \H£) ancillas with an error rate below sin 2 {%/%) ~ 14.6%. 



FIG. 3: These circuits need only be implemented at the highest level of concatenation, and thus all operations depicted are encoded operations. 




(a) 



(b) 



(a) Circuit used to execute an encoded Z 1 / 2 gate on an arbitrary input [12] . (b) Circuit implementing the non-Clifford operation Z 1 / 4 



given the encoded resource magic state \Hi) . 
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ii. Z 1 / 2 (Clifford) gate aided by encoded measurements.- Clifford operations can be generated by the gate set 
{CNOT,H ,Z 1//2 }. CNOT and H are transversal, and the Z 1 / 2 gate can be implemented using the circuit in Fig. (3(a)), 
provided one can prepare a logical ancilla in |±j'l) = (|0l) ±i | lz})/ V2- Since the Z 1 / 2 gate is not part of the EC routines, 
it is only needed at the highest level of concatenation. There are two ways of implementing it: (a) since it is the only 
complex gate, it can be shown that by always using the same logical ancilla prepared in |0l) = 1 /v^(|+i'i) + |— z'z,)) to 
activate the circuit in Fig. (3(a)), then the entire quantum computation splits into two non interfering paths (evolution by 
Ucomp and U* omp ) and the measurements of real, Hermitian operators at the end have the same expectation values as for 
evolution by U comp alone [12]; or, alternatively, (b) one can use the distillation circuit in [ ] at the highest level provided 
one can prepare noisy with an error rate below pti- mc \ = 1/2. 

iii. Encoding of arbitrary ancillas.- Both (i) and (ii) use the preparation of particular encoded states, but they do not require 
them to have an extremely low error rate. So we will show now a routine to encode any desired state to any level of 
concatenation, we showed in Ref.[ ], and summarize in section (IV), that the error rate of the resulting encoded state p anc 
is below the required values P(H-imc) ar, d P(i-anc) an d mus can be used as a resource in the routines described above((i.) 
and (ii)). Thus to encode an arbitrary state we use the following algorithm: (i) we start with the level-0 state |(j)) we want 
to encode and 8 |0) states, then (ii) we use CNOT gates, including waiting times such that never in one step does one qubit 



interact with more than one qubit, to create the state 



4»>, = a 

3x3 



0) +b 

I 3x3 



1 ) . Finally (iii) we execute a fM ^ gate 



3x3 

in every row, to create the state = a \0i) +b\li). We can recursively use the same algorithm to create the state at any 
level of concatenation k. 

Thus combining the above elements (i)-(iii) we can execute any Clifford and non-Clifford encoded operation in the boundaries. 
This, in addition to the encoded T, yields a method to execute fault-tolerant semiglobal universal quantum computation. 

iv. The last element of our scheme is the readout stage: to extract the result of a computation. The challenge is to perform 
the readout without violating the addressability constraints. To do so we use the above described semiglobal FTUQC 
to execute SWAP gates at the highest level of concatenation to move any (x — y) logical plane to one of the end planes 
(z = l,N z ) where measurements are allowed. We can repeat this process until all the desired logical qubits are readout. 



A. Restriction to two dimensional arrays 



The tools presented above were based on a three dimensional array of qubits. Despite being natural for many physical systems, 
it is not even possible for many others. For these latter systems a 2D architecture would be more desirable and we now discuss 
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such a structure. We can start by reducing the dimensionality of every plane to a ID linear array such that the 3D array is reduced 
to a 2D planar array. Although this is in principle possible the question is whether fault-tolerance is respected. This depends on 
the interactions at our disposal in the dimensionally reduced architecture. After the reduction of dimensionality, we now require 
that we can execute fault-tolerant EC in every line: this is not possible in general if we restrict ourselves to nearest-neighbor 
interactions only [17], although see [18] for a special case, but is always possible if we also allow next-to-nearest neighbor 
interactions. We essentially need to show that a SWAP gate between two qubits containing data/information can be executed in 
a fault-tolerant way, i.e. such that one gate error does not generate more than one error in the data qubits. 

To achieve a fault-tolerant swap gate we introduce place-holder qubits, that is qubits that we require specifically to physically 
move information around but which hold no computationally valuable information. To differentiate them from place-holder 
qubits, we shall label the qubits involved in the computation, i.e. data plus ancilla qubits, information holding qubits, i.e. info- 
qubits. Consider a one dimensional line of /n/o-qubits, for example encoding a logical qubit, denoted by p, with interspersed 
placeholder ancilla qubits, denoted by r|y, in between every nearest neighbor pair of info qubits (e.g. any horizontal line 
of Fig. (4)). Using nearest neighbor and next-to-nearest neighbor interactions we can now SWAP two info qubits containing 
relevant information (p and p respectively), in such a way that a single failure in a SWAP gate does not generate two errors in 
the info qubits, through the following routine 



Step 


Gate 


Output state 







Pi ®r|2«)p3®r|4 


1 


SWAP U2 


T|l<8>p2<8>P3 ( 8>p4 


2 


SWAPi3 


Pi ®P2®r|3®r|4 


3 


SWAP 2 , 3 


Pl®Tl2®P3®Tl4 



where the subscripts in p and r| denote the physical locations in the sample four qubit chain. This routine executes the fault- 
tolerant SWAP gate between site 1 and site 3, since qubits containing the relevant information (p and p) never interact directly 
and a SWAP gate does not propagate errors. Since we can reproduce this structure in a longer chain and achieve SWAP gates 
between any two qubits in a chain it is possible to execute fault-tolerant semi-global quantum computation on a 2D array with 
a reduced number of controls. We must have in mind that, although the threshold value will be reduced because of the need to 
swap qubits around to execute the desired gates, a threshold value does exist [17]. It will still remain true that preparation and 
measurement can be very noisy, as the argument behind such result is that gate error rates are sufficiently below the threshold 
value. Moreover, the advantage in terms of number of controls is also maintained in this reduced dimensional design. 
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FIG. 4: Schematics of the two dimensional architecture. All nearest neighbor interactions are required but only next to nearest-neighbor 
interactions between odd qubits, i.e. the equivalent to nearest neighbor between info-qubits, are required. This "cross-free" structure of the 
interactions would be beneficial for instance in solid-state implementations since wires would not need to cross. Each line can also be viewed 
as a one dimensional of 2-type of qubits (ABAB addressability), where only AA, BB and AB nearest neighbor interactions are required. Recall 
that in the logical direction we require ABAB addressability to ensure fault-tolerance. 

IV. HOLOGRAPHIC SEMIGLOBAL FAULT-TOLERANT THRESHOLD 

Let us now discuss the required error rates to achieve fault-tolerant holographic semi-global universal quantum computation. 
Because we are using the tools developed in Ref. [ ] we have essentially the same threshold analysis. There we assumed that 
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FIG. 5: Decomposition of a physical TOFFOLI gate into two qubit interactions when one desired to constraint the problem at most two-qubit 
interaction only. Note that it requires only three time-steps as the first two CX 1 ' 2 s can be executed simultaneously. The last CNOT gate is not 
necessary as we will typically discard the controls of such TOFFOLI and thus they do not count towards our threshold estimation. These gates 
are not encoded gates but always physical gates. 
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at the physical level we had at our disposal three qubit gates, in form of TOFFOLI gates. We showed that armed with three- 
body interactions (in every plane) one can achieve fault-tolerant universal quantum computing if gates and preparation have 
error rates below p(p tg )thresh = 3-76 x 10~ 5 . Measurements are only required at the boundaries and only at the highest level of 
concatenation, and thus fault-tolerance is possible when measurement error rates are below P[ m ) t hresh = 1/3- If we implement 
a gate library at the physical level which does not include the TOFFOLI, then we decompose the TOFFOLI into one and two 
qubit gates, as in Fig.(5). In this case the threshold value for gates and preparation becomes pu v \ t hreshoid = 2-68 x 10~ 5 again 
with measurements as noisy as pt m ) = 1/3. 

Furthermore, we showed in [ ] that provided that the gate error rate ipu)) was sufficiently below p ( g ) t hreshoid tnen one c °uld 
relax the demands on measurement and preparation error rates, (p( m )) and (P( p )) respectively. Following that argument, and 
assuming our gate library contains the TOFFOLI, we find that if p^ = 1.3 x 10~ 6 , p^ = 1% and = 33% yields, with 
e.g. k — 6 degrees of concatenation, and effective error rate at concatenation level k = 6, p' 6 ' ~ 10~ 13 and and effective error 
rate for the output of an encoded state, p a 6 J c — 8.3 x 10~ 3 . Note that plfnc, denoting the ouput error rate of the \Hi) encoder 
circuit, is safely below the 1.46 x 10 needed to execute magic state distillation and achieve fault-tolerant universal quantum 
computation. 

This implies that the massively parallel mapping required to refresh the ancillas in our design can have error rates p/ p ^ as 
high as 1%, or even 33% at the expense of demanding even lower values from pr g s (~ 10~ 6 ). To accommodate more physically 
realistic interactions and assume only nearest neighbor interactions in the xy planes then one can expect a decrease of the 
threshold. However this is a characteristic shared by any addressable or non-addressable fault-tolerant scheme, and previous 
work has shown that by restricting a long-range addressable fault tolerant scheme to be nearest-neighbor can decrease the 
threshold by less than an order of magnitude [ ]. Further, we observe that the dynamical decoupling (DD) protection of gates 
still applies, and such DD protection can be made compatible with our reduced addressability [20]. It follows that the extra 
demand placed on gate error rates(/?( g ) <~ 10~ 5 — 10~ 6 ) can in principle be greatly alleviated by open quantum system control 
techniques [21]. 

V. GENERAL STRATEGY 

The particular strategy to achieve semi-globally addressed fault-tolerant QC we presented above may not be unique, so here 
we want to summarize what are some of the essential requirements of our design. Our scheme relies on two properties: (i) every 
plane can execute error correction in a fault-tolerant manner, and (ii) the z direction is in charge of the computational aspect of 
our array via some interaction capable of coupling different planes. We have chosen the 1 pulse, but in principle other scenarios 
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inspired by other global control strategies could be implemented. 

Lets say we have a global control scheme in one dimension with some (not necessarily nearest-neighbor) interaction T. We 
consider a 3D array as a collection of 2D collection of ID arrays, such that at every ID array we can execute 1 independently. 
Every horizontal plane of our 3D array will constitute a logical qubit, encoded in some QEC code with a set of stabilizers }, 
and the necessary ancillas for its measurement-free error correction. This means that our computer will be initially stabilized 
by (^^{S 1 ,- }. This fixes a relationship between 1 and {S^ }. We must guarantee that (i) at all times all logical qubits in the 
computer must be stabilized by some (possibly changing) code {S^ }, i.e. 1 never leaves the information unprotected, and 
(ii) 1 can propagate an error in some plane to only one qubit in any number of planes (nearest neighbor interactions fix the 
propagation of errors to nearest neighbor planes). The above requirements will be enough to achieve parallel FT UQEC in all 
qubits and transport of information. Extra requirements for fault-tolerant universal computation depend on the type of global 
control scheme one considers. 

A. Scaling of the number of controls 

To see that the semi-global architecture saves on resources we now investigate the spatial addressing efficiency of our design. 
We first count the number of controls required to execute a fault-tolerant semi-global quantum computer architecture and com- 
pare it with a fully addressable quantum computer architecture simulating the same quantum circuit to the same overall accuracy. 
As a first step, we assume the same measurement free EC routines but with full addressing, since we really want to know if we 
gain something using the 3D layout instead of just using a 2D one, and compare the number of controls required (N[uAdd\ an d 
N[ sg ] respectively) . We then proceed to compare the 3D layout to a fully addressable, measurement capable architecture using 
the same QEC code (which will be labeled by [mAdd]). 

Defining the parameters: Nc logical/computational qubits encoded using a QEC code which encodes each logical qubit into 
Nec physical qubits with k degrees of concatenation, and EC gadget using Na encoded ancilla qubits and Nb classically encoded 
ancilla qubits. A naive count shows that the number of controls in the fully addressable measurement-free (labeled with [uAdd]) 
and semiglobal measurement free (labeled with [sg]) architectures are given by [28] 

N [u Add] = N c x [(Nec+Na) x 9*t""fl -1 +N B x 3'l'^r 1 ] (6) 
N [sg] = (Nec+Na)*^- 1 +N B x3 k ^r l (7) 

Using the EC protocols described before, Nec — 9,Na = 18 and Nb = 6. From (7) it would seem that N[ sg ] does not depend on 
the number of computational qubits, and that N^ U Add] /N[sg] = Nc, however that is not the case as k^ is a function of Nc, albeit 
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with a weaker dependence. To see this we consider the result of the threshold theorem [4]: given a circuit which we wish to 
simulate to an accuracy e, whose size is a polynomial in the number of computational qubits, f(Nc), then 



P {k) xf(N c ) < e 
(Ap^f/A < e/f(N c ), 



(8) 
(9) 



where = A(p( k ~ l >) 2 is the error probability of an operation at level k of concatenation in terms of level-(k-l) error rates and 
A counts the number of pairs of possible level- (k— 1) errors (in the largest exRec defined in [13] as a gate with appended and 
preprended EC(k) routines) 

]o „ / <i/P,l,re S l, \ 

10g V f[-](Nc) J 



kn = \0g 



(10) 



log (pW/pthresh) 

where we have emphasized the dependence of f(N c ) on the design, i.e. for [■] = {[sg], [uAdd], [mAdd]}. This equation gives 
real values whenever concatenation is not harmful. Assuming that the polynomial scaling of the circuit size with the number of 
computational qubits is with power t, i.e. f[ U Add]{Nc) = $N' C , we have 

log(4JV c ) 



5k = k[ sg ] - k]^^ = log 2 
= log 2 



1 



1 



log (f[uAdd] (Nc)pthresh / e) 

log(JV c )+log4 



(ID 



logAfc + logCpA^ l pthre S h/e)) 

In general, because of the discreteness of k we will have an effective discrete difference between degrees of concatenation Ak>0 
instead of a continuous value for bk. This discrete difference oscillates as the number of computational qubits increases. In any 
case we will have that 

(N EC +N a )x 9V<«] +N B x 3 k l" AM i 



N[uAdd]/N [sg] = N c - 
= N C : 



[N EC +N a )x 9 k M +N B * 3*[» 
1 i 2 + 'i 2+ H uA M \ 



(12) 
(13) 



3 M I 2 + 3 2+<:i Jg ] J ' 

where Ak — \k[ sg ] \ — \k^ U Add]\- So in general the semiglobal architecture will have a gain °= Nc in the number of controls, i.e. 
N[ U Add] /N[ sg ] > 1 ■ More generally, we determine that there is a significant advantage, i.e. Ak < 1 => N^^/N^ = Nc, whenever 
log4 < log (PAT^T WW e )- 

As an example, let us consider the Shor factorization algorithm. To factor aN = 768 bit integer would require Nc = 2N + 4 = 
1540 qubits, and a circuit size of 8A^ 4 = 2.8 x 10 12 logical gates, if we want a 97% overall success rate, using p^ = 10~ 6 < pthresh 
we get that Ak = 1 and thus we gain factor of N^ u Add]l^[sg\ ~ 40.6 x 10 6 /19845 = 768 reduction in the number of controls 
controls. A N=2048 string would yield Ak = and thus a gain of N[ uAdd ] /N[ sg ] - 4.59 x 10 6 /19845 = 2048, while a 4096 string 
yields Ak = 1 and N [uAdd] /N [sg] - 81.3 x 10 6 /T9845 ~ 4096. 
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We can go a step further and compare a fully addressable model with error correction gadgets that admit measurements with 
our semi-global unitary QEC model. We will consider the gadgets and threshold values for our same QEC code obtained in 
Ref.[13] with a corresponding set of parameters N' A = 18, p ''^reshoid = 1-2 X 10~ 4 . Because such EC gadgets would typically use 
a different number of ancillas N' A < N A and have different threshold value v'threshaW we S et 

(N E c+N' A )x9 k '- 1 

N, nAdd] /N [sg] =N C V f C , A > ■ . (14) 

(Nec+Na) x9 M +iV fl x 3 M 

with 

W = k [sg] -kf (15) 

= _ 1q [" l0 ^(p'rhre S l,f',nAdd})) y log(p {0) / Prhresh) 
2 [l0g(e/ '(pthreshf[mAdd])) log(p(°) '/ 'p'thresh) 

The difference with the previous case is that now Ak' is more likely to be 1 and not 0, as follows from the observation that 
typically k' < k]uAdd\. Assuming we have a physical error rate good enough for both threshold values, pthresh an d p' t hresh> 
e.g. p^ = 1 x 10~ 6 , we get for Shor's protocol on the strings used in the previous example, Ak' — 1 and A^^i /N[sg] 1 768 ~ 
1.68 x 10 6 /19845 - 84, for 2048 Ak' = 1 and thus N' [mAdd] /N [sg] | 2 048 4.48 x 10 6 /19845 - 225, and for 4096 Ak' = 1 again and 
we get N'^ mAdd ^/N[ sg ]\4096 ~ 8.96 x 10 6 /19845 ^451 which implies a slightly smaller gain than in the previous case, but still 
significant ~ ■^c( 9M / +3M /i /t / +l x2 p )■ Even compared to gadgets where measurements are allowed the semiglobal architecture 
yields a 0(Nc) gain in terms of the number of controls. 



VI. EXPERIMENTAL CONSIDERATIONS 

In terms of an experimental realization, given a 3D array of qubits, the addressability requirements can be translated into : 

i. Massively parallel nearest neighbor CZ gates in the z direction: Y[(x,y) ^(x,y) ■ 

ii. Single-qubit (horizontal) gates: Ur x>y \ = Y\zU{x,y,z) f° r an y (•*>)') 

hi. Two-qubit (horizontal) gates: V^y),^',/)) = YhY{x,y^),{x! ,y ,z)> f° r an Y ( x ^y) an( ^ iV) nearest neighbors in a 2D lattice, 
iv. Massively parallel resetting: p — > |0) (0| 

One candidate technology for quantum computation well suited to holographic control is neutral atoms trapped in a three 
dimensional optical lattice. In this architecture one has the advantage of massive parallelism but since the lattice spacing is 
typically on the order of an optical wavelength it is difficult to accommodate imaging lenses that could resolve individual 
qubit measurement outcomes in the bulk. To get some idea of the size of a computation that could be realized in such a 
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system, in Ref. [22] the authors find that 10 6 133 Cs atoms could be trapped in a 100 x 100 x 100 blue detuned lattice with 
an achievable single qubit gate error rate of 10~ 5 using Raman based gates. Here the lattice spacing would be 10/jm and it 
has already been demonstrated [ ] that single ID tubes of 87 Rb atoms trapped in a 2D lattice can be addressed with better 
than 1/jrn resolution. Furthermore, using an architecture such as the 3D retroreflected lattice used in [25] it is possible to have 
ABAB type addressability along one or two dimensions enabling addressability of every other plane. If we make the reasonable 
assumption of a single qubit reset error rate of 10~ 5 and the (extremely optimistic) assumption of the same error rate for two 
qubit gates, then restricting to nearest neighbour interactions, a computation with ~ 10 5 sequential logical gates could be 
achieved on 100 logical qubits using holographic control with an overall circuit simulation error of 1/3. This would require 
3 levels of concatenation which could be accommodated in each 100 x 100 plane with room left over for resettable ancilla to 
shuttle quantum information. Massively parrallel two qubit CPHASE gates have been realized in optical lattices but with rather 
low fidelity [24]. The best reported two qubit gate using parallel exchange blockade mechanism between neighboring trapped 
atoms in an optical lattice realised an error for the VSWAP gate of 0.31 [ ] though theory predicts error rates as low as 1% 
could be achieved [26]. There are several other proposals for high fidelity entangling gates in optical lattices, e.g. using fast 
Rydberg gates [ ], but to achieve an error below our threshold would likely require another approach such as using dynamical 
decoupling pulses to boost the effective two qubit gate fidelity [21]. 

We note that the energy required by a control pulse increases with the number of x-y planes it must control and this may pose 
a limit to the size of the computation the architecture may implement. However if the physical model permits, one could place 
several several logical qubits per plane, i.e. one plane = several tiles of logical qubits, such that a pulse with limited addressing 
capacity can still be used to build an, in principle, arbitrarily large computer with the tools described in this paper. Universality 
follows form the fact that we can achieve SWAP gates between tiles within different planes and thus execute any two logical 
qubit gate. 

VII. CONCLUSIONS 

We have shown that in a N-dimensional (N = 2,3) qubit array fault-tolerance is achievable when only (N — 1) -dimensional 
addressability and fixed short-range interactions are available. The scheme has implications on the design of scalable quantum 
computers and shows an advantage in the number of controls required to manipulate the array of qubits. More specifically 
the number of controls required depends only weakly on the number of computational qubits, as opposed to fully addressable 
designs where they grow linearly, or equivalently we have a gain factor 0(Nc) in the number of controls. The design is suitable 
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for 3D optical lattices, and for 2D arrays with only nearest and next-to-nearest neighbor couplings. 
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